Toward Automatic Character Identification in Unannotated Narrative Text

نویسندگان

  • Josep Valls-Vargas
  • Santiago Ontañón
  • Jichen Zhu
چکیده

We present a case-based approach to character identification in natural language text in the context of our Voz system. Voz first extracts entities from the text, and for each one of them, computes a feature-vector using both linguistic information and external knowledge. We propose a new similarity measure called Continuous Jaccard that exploits those feature-vectors to compute the similarity between a given entity and those in the casebase, and thus determine which entities are characters or not. We evaluate our approach by comparing it with different similarity measures and feature sets. Results show an identification accuracy of up to 93.49%, significantly higher than recent related work.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Toward Automatic Role Identification in Unannotated Folk Tales

This paper presents an approach for automatically identifying high-level narrative structure information, particularly character roles, from unannotated folk tales. We introduce a new representation called action matrices to encode Propp’s narrative theory on character role and their “sphere of action.” We tested our approach in a fully automated system (Voz) using a corpus of 10 folk tales. Ou...

متن کامل

Toward a Computational Model for the Automatic Generation of Character Personality in Interactive Narrative

This paper introduces an approach for the incorporation of interesting and compelling characters in automatically generated interactive narrative. The approach is based on the development of a computational model that enables virtual characters to have distinct and welldefined personalities. In this model, character personality is founded on the hypothesis that choices that lead to actions can ...

متن کامل

Building a Bank of Semantically Encoded Narratives

We propose a methodology for a novel type of discourse annotation whose model is tuned to the analysis of a text as narrative. This is intended to be the basis of a “story bank” resource that would facilitate the automatic analysis of narrative structure and content. The methodology calls for annotators to construct propositions that approximate a reference text, by selecting predicates and arg...

متن کامل

Narrative Hermeneutic Circle: Improving Character Role Identification from Natural Language Text via Feedback Loops

While most natural language understanding systems rely on a pipeline-based architecture, certain human text interpretation methods are based on a cyclic process between the whole text and its parts: the hermeneutic circle. In the task of automatically identifying characters and their narrative roles, we propose a feedback-loop-based approach where the output of later modules of the pipeline is ...

متن کامل

Author gender identification from text using Bayesian Random Forest

Nowadays high usage of users from virtual environments and their connection via social networks like Facebook, Instagram, and Twitter shows the necessity of finding out shared subjects in this environment more than before. There are several applications that benefit from reliable methods for inferring age and gender of users in social media. Such applications exist across a wide area of fields,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014